The Limitation of MapReduce: A Probing Case and a Lightweight Solution
نویسندگان
چکیده
MapReduce is arguably the most successful parallelization framework especially for processing large data sets in datacenters comprising commodity computers. However, difficulties are observed in porting sophisticated applications to MapReduce, albeit the existence of numerous parallelization opportunities. Intrinsically, the MapReduce design allows a program to scale up to handle extremely large data sets, but constrains a program’s ability to process smaller data items and exploit variable-degrees of parallelization opportunities which are likely to be the common case in general application. In this paper, we analyze the limitations of MapReduce and present the design and implementation of a new lightweight parallelization framework, MRlite. MRlite can efficiently process moderatesize data with dependences among numerous computational steps. In the mean time, the parallelization on each step emulates the MapReduce model. Hence, the MRlite framework can also scale up for large data sets if massive parallelism with minimal dependence exists. MRlite can significantly improve the flexibility and parallel execution performance for a number of typical programs. Our evaluation shows that MRlite is one order of magnitude faster than Hadoop on problems that MapReduce has difficulty in handling. Keywords-Distributed computing; Parallel architectures
منابع مشابه
Application of Phase Change Material (PCM) for Cooling Load Reduction in Lightweight and Heavyweight Buildings: Case Study of a High Cooling Load Region of Iran
The application of phase change material (PCM) for energy conservation purposes in the residential buildings was investigated in the present study. Two types of building in terms of materials as the lightweight building (LWB) and heavyweight building (HWB) located in a high cooling load demanding region of Iran were considered for the study. Different types of PCM from organic and inorganic cat...
متن کاملAn Incentive-Aware Lightweight Secure Data Sharing Scheme for D2D Communication in 5G Cellular Networks
Due to the explosion of smart devices, data traffic over cellular networks has seen an exponential rise in recent years. This increase in mobile data traffic has caused an immediate need for offloading traffic from operators. Device-to-Device(D2D) communication is a promising solution to boost the capacity of cellular networks and alleviate the heavy burden on backhaul links. However, dir...
متن کاملAdaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments
Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...
متن کاملNazhvan Pavilion in Isfahan, Construction Technics and an Experience for Building a Lightweight Structure
The Coastal area of Nazhvan is located at the riverside of Zayandehrūd River in the western part of Isfahan. In the gardens of this area which are mostly orchards, woodlands full of fruitless trees, owners have constructed architectural spaces and pavilions. One of the prominent architectural spaces within this territory is a mill known as “Asyāb-e Nazhvān or Hājjī”. There stands a two-floor pa...
متن کاملNazhvan Pavilion in Isfahan, Construction Technics and an Experience for Building a Lightweight Structure
The Coastal area of Nazhvan is located at the riverside of Zayandehrūd River in the western part of Isfahan. In the gardens of this area which are mostly orchards, woodlands full of fruitless trees, owners have constructed architectural spaces and pavilions. One of the prominent architectural spaces within this territory is a mill known as “Asyāb-e Nazhvān or Hājjī”. There stands a two-floor pa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010